Overview

Dataset statistics

Number of variables32
Number of observations20556
Missing cells157560
Missing cells (%)24.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.0 MiB
Average record size in memory461.3 B

Variable types

NUM28
CAT4

Reproduction

Analysis started2020-05-30 01:06:23.044699
Analysis finished2020-05-30 01:07:56.720007
Duration1 minute and 33.68 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

iso_code has a high cardinality: 211 distinct values High cardinality
location has a high cardinality: 212 distinct values High cardinality
date has a high cardinality: 151 distinct values High cardinality
new_cases is highly correlated with total_cases and 1 other fieldsHigh correlation
total_cases is highly correlated with new_cases and 2 other fieldsHigh correlation
total_deaths is highly correlated with total_casesHigh correlation
new_deaths is highly correlated with new_casesHigh correlation
total_tests is highly correlated with total_cases and 2 other fieldsHigh correlation
new_tests is highly correlated with total_tests and 1 other fieldsHigh correlation
new_tests_smoothed is highly correlated with total_tests and 1 other fieldsHigh correlation
new_tests_smoothed_per_thousand is highly correlated with new_tests_per_thousandHigh correlation
new_tests_per_thousand is highly correlated with new_tests_smoothed_per_thousandHigh correlation
aged_65_older is highly correlated with median_age and 1 other fieldsHigh correlation
median_age is highly correlated with aged_65_olderHigh correlation
aged_70_older is highly correlated with aged_65_olderHigh correlation
total_cases_per_million has 385 (1.9%) missing values Missing
new_cases_per_million has 385 (1.9%) missing values Missing
total_deaths_per_million has 385 (1.9%) missing values Missing
new_deaths_per_million has 385 (1.9%) missing values Missing
total_tests has 15041 (73.2%) missing values Missing
new_tests has 15644 (76.1%) missing values Missing
total_tests_per_thousand has 15041 (73.2%) missing values Missing
new_tests_per_thousand has 15644 (76.1%) missing values Missing
new_tests_smoothed has 14508 (70.6%) missing values Missing
new_tests_smoothed_per_thousand has 14508 (70.6%) missing values Missing
tests_units has 13909 (67.7%) missing values Missing
stringency_index has 4309 (21.0%) missing values Missing
population_density has 910 (4.4%) missing values Missing
median_age has 1863 (9.1%) missing values Missing
aged_65_older has 2115 (10.3%) missing values Missing
aged_70_older has 1957 (9.5%) missing values Missing
gdp_per_capita has 2122 (10.3%) missing values Missing
extreme_poverty has 8331 (40.5%) missing values Missing
cvd_death_rate has 1944 (9.5%) missing values Missing
diabetes_prevalence has 1259 (6.1%) missing values Missing
female_smokers has 5405 (26.3%) missing values Missing
male_smokers has 5569 (27.1%) missing values Missing
handwashing_facilities has 12421 (60.4%) missing values Missing
hospital_beds_per_100k has 3392 (16.5%) missing values Missing
new_cases_per_million is highly skewed (γ1 = 30.78732471) Skewed
new_deaths_per_million is highly skewed (γ1 = 23.52733283) Skewed
total_cases has 3332 (16.2%) zeros Zeros
new_cases has 8586 (41.8%) zeros Zeros
total_deaths has 8762 (42.6%) zeros Zeros
new_deaths has 14426 (70.2%) zeros Zeros
total_cases_per_million has 2975 (14.5%) zeros Zeros
new_cases_per_million has 8219 (40.0%) zeros Zeros
total_deaths_per_million has 8400 (40.9%) zeros Zeros
new_deaths_per_million has 14050 (68.3%) zeros Zeros
new_tests_smoothed_per_thousand has 237 (1.2%) zeros Zeros
stringency_index has 2171 (10.6%) zeros Zeros

Variables

iso_code
Categorical

HIGH CARDINALITY

Distinct count211
Unique (%)1.0%
Missing64
Missing (%)0.3%
Memory size160.7 KiB
CAN
 
151
BEL
 
151
DEU
 
151
SGP
 
151
AUT
 
151
Other values (206)
19737
ValueCountFrequency (%) 
CAN1510.7%
 
BEL1510.7%
 
DEU1510.7%
 
SGP1510.7%
 
AUT1510.7%
 
KOR1510.7%
 
RUS1510.7%
 
FIN1510.7%
 
GRC1510.7%
 
GBR1510.7%
 
IRN1510.7%
 
MYS1510.7%
 
BRA1510.7%
 
CZE1510.7%
 
ISL1510.7%
 
MEX1510.7%
 
FRA1510.7%
 
NPL1510.7%
 
AUS1510.7%
 
HRV1510.7%
 
ISR1510.7%
 
NLD1510.7%
 
CHE1510.7%
 
SWE1510.7%
 
NOR1510.7%
 
Other values (186)1671781.3%
 

Length

Max length8
Median length3
Mean length3.036728936
Min length3

Overview of Unicode Properties

Unique unicode characters29
Unique unicode categories (?)3
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
R50828.1%
 
A48727.8%
 
N45007.2%
 
M36565.9%
 
L34385.5%
 
S33635.4%
 
E31415.0%
 
G31295.0%
 
B30244.8%
 
I27634.4%
 
T27104.3%
 
C25674.1%
 
U24774.0%
 
D23483.8%
 
O22063.5%
 
P18743.0%
 
H18132.9%
 
K17592.8%
 
Z13552.2%
 
W12242.0%
 
V12101.9%
 
Y11581.9%
 
F9791.6%
 
J5500.9%
 
X5100.8%
 
Other values (4)7151.1%
 

Most occurring categories

ValueCountFrequency (%) 
Uppercase Letter6208099.5%
 
Lowercase Letter1920.3%
 
Connector Punctuation1510.2%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
R50828.2%
 
A48727.8%
 
N45007.2%
 
M36565.9%
 
L34385.5%
 
S33635.4%
 
E31415.1%
 
G31295.0%
 
B30244.9%
 
I27634.5%
 
T27104.4%
 
C25674.1%
 
U24774.0%
 
D23483.8%
 
O22063.6%
 
P18743.0%
 
H18132.9%
 
K17592.8%
 
Z13552.2%
 
W12242.0%
 
V12101.9%
 
Y11581.9%
 
F9791.6%
 
J5500.9%
 
X5100.8%
 

Most frequent Connector Punctuation characters

ValueCountFrequency (%) 
_151100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n12866.7%
 
a6433.3%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin6227299.8%
 
Common1510.2%
 

Most frequent Latin characters

ValueCountFrequency (%) 
R50828.2%
 
A48727.8%
 
N45007.2%
 
M36565.9%
 
L34385.5%
 
S33635.4%
 
E31415.0%
 
G31295.0%
 
B30244.9%
 
I27634.4%
 
T27104.4%
 
C25674.1%
 
U24774.0%
 
D23483.8%
 
O22063.5%
 
P18743.0%
 
H18132.9%
 
K17592.8%
 
Z13552.2%
 
W12242.0%
 
V12101.9%
 
Y11581.9%
 
F9791.6%
 
J5500.9%
 
X5100.8%
 
Other values (3)5640.9%
 

Most frequent Common characters

ValueCountFrequency (%) 
_151100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII62423100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
R50828.1%
 
A48727.8%
 
N45007.2%
 
M36565.9%
 
L34385.5%
 
S33635.4%
 
E31415.0%
 
G31295.0%
 
B30244.8%
 
I27634.4%
 
T27104.3%
 
C25674.1%
 
U24774.0%
 
D23483.8%
 
O22063.5%
 
P18743.0%
 
H18132.9%
 
K17592.8%
 
Z13552.2%
 
W12242.0%
 
V12101.9%
 
Y11581.9%
 
F9791.6%
 
J5500.9%
 
X5100.8%
 
Other values (4)7151.1%
 

location
Categorical

HIGH CARDINALITY

Distinct count212
Unique (%)1.0%
Missing0
Missing (%)0.0%
Memory size160.7 KiB
Denmark
 
151
France
 
151
Canada
 
151
World
 
151
Norway
 
151
Other values (207)
19801
ValueCountFrequency (%) 
Denmark1510.7%
 
France1510.7%
 
Canada1510.7%
 
World1510.7%
 
Norway1510.7%
 
United Kingdom1510.7%
 
Vietnam1510.7%
 
Israel1510.7%
 
Nepal1510.7%
 
China1510.7%
 
United States1510.7%
 
Australia1510.7%
 
Croatia1510.7%
 
Estonia1510.7%
 
Iceland1510.7%
 
Netherlands1510.7%
 
Mexico1510.7%
 
Germany1510.7%
 
Italy1510.7%
 
Belgium1510.7%
 
Belarus1510.7%
 
Switzerland1510.7%
 
Austria1510.7%
 
Japan1510.7%
 
South Korea1510.7%
 
Other values (187)1678181.6%
 

Length

Max length32
Median length7
Mean length8.625267562
Min length4

Overview of Unicode Properties

Unique unicode characters56
Unique unicode categories (?)7
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
a2665115.0%
 
i152158.6%
 
n150918.5%
 
e126147.1%
 
r103985.9%
 
o87304.9%
 
t70794.0%
 
l66033.7%
 
d60043.4%
 
u59593.4%
 
s58983.3%
 
56793.2%
 
c34702.0%
 
g34521.9%
 
m33671.9%
 
S29671.7%
 
b28031.6%
 
h26631.5%
 
y20221.1%
 
C19821.1%
 
M19791.1%
 
p19021.1%
 
B18961.1%
 
I18621.1%
 
A18081.0%
 
Other values (31)1920710.8%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter14586682.3%
 
Uppercase Letter2548514.4%
 
Space Separator56793.2%
 
Other Punctuation77< 0.1%
 
Open Punctuation65< 0.1%
 
Close Punctuation65< 0.1%
 
Dash Punctuation64< 0.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
S296711.6%
 
C19827.8%
 
M19797.8%
 
B18967.4%
 
I18627.3%
 
A18087.1%
 
G15356.0%
 
N12254.8%
 
L11114.4%
 
P10624.2%
 
T9753.8%
 
R9683.8%
 
K9423.7%
 
E9413.7%
 
U8103.2%
 
F6502.6%
 
D5762.3%
 
V5702.2%
 
H4301.7%
 
J3781.5%
 
Z2911.1%
 
W1850.7%
 
Q1470.6%
 
O1450.6%
 
Y500.2%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
a2665118.3%
 
i1521510.4%
 
n1509110.3%
 
e126148.6%
 
r103987.1%
 
o87306.0%
 
t70794.9%
 
l66034.5%
 
d60044.1%
 
u59594.1%
 
s58984.0%
 
c34702.4%
 
g34522.4%
 
m33672.3%
 
b28031.9%
 
h26631.8%
 
y20221.4%
 
p19021.3%
 
w13780.9%
 
z12610.9%
 
k10580.7%
 
v8670.6%
 
f4770.3%
 
j3160.2%
 
x2950.2%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
5679100.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
'77100.0%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-64100.0%
 

Most frequent Open Punctuation characters

ValueCountFrequency (%) 
(65100.0%
 

Most frequent Close Punctuation characters

ValueCountFrequency (%) 
)65100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin17135196.6%
 
Common59503.4%
 

Most frequent Latin characters

ValueCountFrequency (%) 
a2665115.6%
 
i152158.9%
 
n150918.8%
 
e126147.4%
 
r103986.1%
 
o87305.1%
 
t70794.1%
 
l66033.9%
 
d60043.5%
 
u59593.5%
 
s58983.4%
 
c34702.0%
 
g34522.0%
 
m33672.0%
 
S29671.7%
 
b28031.6%
 
h26631.6%
 
y20221.2%
 
C19821.2%
 
M19791.2%
 
p19021.1%
 
B18961.1%
 
I18621.1%
 
A18081.1%
 
G15350.9%
 
Other values (26)1740110.2%
 

Most frequent Common characters

ValueCountFrequency (%) 
567995.4%
 
'771.3%
 
(651.1%
 
)651.1%
 
-641.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII177301100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
a2665115.0%
 
i152158.6%
 
n150918.5%
 
e126147.1%
 
r103985.9%
 
o87304.9%
 
t70794.0%
 
l66033.7%
 
d60043.4%
 
u59593.4%
 
s58983.3%
 
56793.2%
 
c34702.0%
 
g34521.9%
 
m33671.9%
 
S29671.7%
 
b28031.6%
 
h26631.5%
 
y20221.1%
 
C19821.1%
 
M19791.1%
 
p19021.1%
 
B18961.1%
 
I18621.1%
 
A18081.0%
 
Other values (31)1920710.8%
 

date
Categorical

HIGH CARDINALITY

Distinct count151
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size160.7 KiB
2020-05-21
 
211
2020-05-25
 
211
2020-05-24
 
211
2020-05-19
 
211
2020-05-26
 
211
Other values (146)
19501
ValueCountFrequency (%) 
2020-05-212111.0%
 
2020-05-252111.0%
 
2020-05-242111.0%
 
2020-05-192111.0%
 
2020-05-262111.0%
 
2020-05-202111.0%
 
2020-05-182111.0%
 
2020-05-152111.0%
 
2020-05-162111.0%
 
2020-05-232111.0%
 
2020-05-172111.0%
 
2020-05-222111.0%
 
2020-05-072101.0%
 
2020-05-052101.0%
 
2020-05-272101.0%
 
2020-05-282101.0%
 
2020-05-092101.0%
 
2020-05-042101.0%
 
2020-05-062101.0%
 
2020-05-082101.0%
 
2020-05-112101.0%
 
2020-05-122101.0%
 
2020-05-132101.0%
 
2020-05-142101.0%
 
2020-05-022101.0%
 
Other values (126)1529474.4%
 

Length

Max length10
Median length10
Mean length10
Min length10

Overview of Unicode Properties

Unique unicode characters11
Unique unicode categories (?)2
Unique unicode scripts (?)1
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
06895833.5%
 
25260425.6%
 
-4111220.0%
 
1113005.5%
 
481994.0%
 
581404.0%
 
368463.3%
 
921651.1%
 
720841.0%
 
820841.0%
 
620681.0%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number16444880.0%
 
Dash Punctuation4111220.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
06895841.9%
 
25260432.0%
 
1113006.9%
 
481995.0%
 
581404.9%
 
368464.2%
 
921651.3%
 
720841.3%
 
820841.3%
 
620681.3%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-41112100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Common205560100.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
06895833.5%
 
25260425.6%
 
-4111220.0%
 
1113005.5%
 
481994.0%
 
581404.0%
 
368463.3%
 
921651.1%
 
720841.0%
 
820841.0%
 
620681.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII205560100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
06895833.5%
 
25260425.6%
 
-4111220.0%
 
1113005.5%
 
481994.0%
 
581404.0%
 
368463.3%
 
921651.1%
 
720841.0%
 
820841.0%
 
620681.0%
 

total_cases
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count5620
Unique (%)27.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19342.057793345008
Minimum0
Maximum5776934
Zeros3332
Zeros (%)16.2%
Memory size160.7 KiB

Quantile statistics

Minimum0
5-th percentile0
Q16
median99
Q31329.25
95-th percentile38690.25
Maximum5776934
Range5776934
Interquartile range (IQR)1323.25

Descriptive statistics

Standard deviation198317.8553
Coefficient of variation (CV)10.25319319
Kurtosis438.5225615
Mean19342.05779
Median Absolute Deviation (MAD)99
Skewness19.62476663
Sum397595340
Variance3.932997174e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0333216.2%
 
16243.0%
 
33461.7%
 
23161.5%
 
112931.4%
 
182501.2%
 
162411.2%
 
62211.1%
 
72041.0%
 
82031.0%
 
51940.9%
 
41750.9%
 
151740.8%
 
121730.8%
 
101720.8%
 
191490.7%
 
141440.7%
 
91430.7%
 
131320.6%
 
251090.5%
 
241080.5%
 
39800.4%
 
21770.4%
 
17750.4%
 
22730.4%
 
Other values (5595)1254861.0%
 
ValueCountFrequency (%) 
0333216.2%
 
16243.0%
 
23161.5%
 
33461.7%
 
41750.9%
 
51940.9%
 
62211.1%
 
72041.0%
 
82031.0%
 
91430.7%
 
ValueCountFrequency (%) 
57769341< 0.1%
 
56577521< 0.1%
 
55561301< 0.1%
 
54602541< 0.1%
 
53711581< 0.1%
 
52769421< 0.1%
 
51758361< 0.1%
 
50692621< 0.1%
 
49613381< 0.1%
 
48619751< 0.1%
 

new_cases
Real number (ℝ)

HIGH CORRELATION
ZEROS

Distinct count1856
Unique (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean562.067912045145
Minimum-2461
Maximum119182
Zeros8586
Zeros (%)41.8%
Memory size160.7 KiB

Quantile statistics

Minimum-2461
5-th percentile0
Q10
median2
Q346
95-th percentile1144.5
Maximum119182
Range121643
Interquartile range (IQR)46

Descriptive statistics

Standard deviation5007.224301
Coefficient of variation (CV)8.908575269
Kurtosis258.4626488
Mean562.067912
Median Absolute Deviation (MAD)2
Skewness15.3614804
Sum11553868
Variance25072295.2
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0858641.8%
 
111365.5%
 
26963.4%
 
34682.3%
 
43871.9%
 
63101.5%
 
53011.5%
 
72241.1%
 
82201.1%
 
92051.0%
 
101991.0%
 
111710.8%
 
121540.7%
 
151500.7%
 
131350.7%
 
141260.6%
 
171080.5%
 
161080.5%
 
191010.5%
 
18980.5%
 
21910.4%
 
22880.4%
 
20860.4%
 
27750.4%
 
28720.4%
 
Other values (1831)626130.5%
 
ValueCountFrequency (%) 
-24611< 0.1%
 
-14801< 0.1%
 
-7131< 0.1%
 
-5251< 0.1%
 
-3721< 0.1%
 
-2091< 0.1%
 
-1611< 0.1%
 
-1151< 0.1%
 
-1051< 0.1%
 
-501< 0.1%
 
ValueCountFrequency (%) 
1191821< 0.1%
 
1079241< 0.1%
 
1065741< 0.1%
 
1016221< 0.1%
 
1011061< 0.1%
 
1005481< 0.1%
 
993631< 0.1%
 
980331< 0.1%
 
966651< 0.1%
 
958761< 0.1%
 

total_deaths
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count1922
Unique (%)9.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1275.1473535707337
Minimum0
Maximum360089
Zeros8762
Zeros (%)42.6%
Memory size160.7 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q328
95-th percentile1765.25
Maximum360089
Range360089
Interquartile range (IQR)28

Descriptive statistics

Standard deviation13366.44884
Coefficient of variation (CV)10.48227784
Kurtosis414.8385606
Mean1275.147354
Median Absolute Deviation (MAD)2
Skewness19.15974336
Sum26211929
Variance178661954.7
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0876242.6%
 
114857.2%
 
27053.4%
 
35882.9%
 
53811.9%
 
43761.8%
 
63391.6%
 
73001.5%
 
102951.4%
 
92911.4%
 
82701.3%
 
121560.8%
 
111560.8%
 
211210.6%
 
131090.5%
 
141060.5%
 
15990.5%
 
17950.5%
 
20920.4%
 
26780.4%
 
24780.4%
 
19770.4%
 
22760.4%
 
18760.4%
 
16720.4%
 
Other values (1897)537326.1%
 
ValueCountFrequency (%) 
0876242.6%
 
114857.2%
 
27053.4%
 
35882.9%
 
43761.8%
 
53811.9%
 
63391.6%
 
73001.5%
 
82701.3%
 
92911.4%
 
ValueCountFrequency (%) 
3600891< 0.1%
 
3553561< 0.1%
 
3502131< 0.1%
 
3462771< 0.1%
 
3428941< 0.1%
 
3420781< 0.1%
 
3380891< 0.1%
 
3333991< 0.1%
 
3279571< 0.1%
 
3231561< 0.1%
 

new_deaths
Real number (ℝ)

HIGH CORRELATION
ZEROS

Distinct count586
Unique (%)2.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.03492897450866
Minimum-1918
Maximum10520
Zeros14426
Zeros (%)70.2%
Memory size160.7 KiB

Quantile statistics

Minimum-1918
5-th percentile0
Q10
median0
Q31
95-th percentile58
Maximum10520
Range12438
Interquartile range (IQR)1

Descriptive statistics

Standard deviation332.876747
Coefficient of variation (CV)9.501282198
Kurtosis321.0569448
Mean35.03492897
Median Absolute Deviation (MAD)0
Skewness16.60452539
Sum720178
Variance110806.9287
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
01442670.2%
 
114737.2%
 
27403.6%
 
34422.2%
 
43471.7%
 
52341.1%
 
62061.0%
 
71670.8%
 
81400.7%
 
101190.6%
 
91170.6%
 
11900.4%
 
13710.3%
 
12710.3%
 
15630.3%
 
16590.3%
 
14560.3%
 
17460.2%
 
18420.2%
 
21390.2%
 
22340.2%
 
28320.2%
 
19310.2%
 
25300.1%
 
23280.1%
 
Other values (561)14537.1%
 
ValueCountFrequency (%) 
-19181< 0.1%
 
01442670.2%
 
114737.2%
 
27403.6%
 
34422.2%
 
43471.7%
 
52341.1%
 
62061.0%
 
71670.8%
 
81400.7%
 
ValueCountFrequency (%) 
105201< 0.1%
 
87091< 0.1%
 
85681< 0.1%
 
76631< 0.1%
 
76041< 0.1%
 
74451< 0.1%
 
74441< 0.1%
 
72841< 0.1%
 
72211< 0.1%
 
66711< 0.1%
 

total_cases_per_million
Real number (ℝ≥0)

MISSING
ZEROS

Distinct count11302
Unique (%)56.0%
Missing385
Missing (%)1.9%
Infinite0
Infinite (%)0.0%
Mean538.8408424470774
Minimum0.0
Maximum19741.881999999998
Zeros2975
Zeros (%)14.5%
Memory size160.7 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10.825
median33.94
Q3274.124
95-th percentile2983.7155
Maximum19741.882
Range19741.882
Interquartile range (IQR)273.299

Descriptive statistics

Standard deviation1535.798857
Coefficient of variation (CV)2.850190142
Kurtosis53.47733858
Mean538.8408424
Median Absolute Deviation (MAD)33.94
Skewness6.261412086
Sum10868958.63
Variance2358678.13
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0297514.5%
 
0.034670.3%
 
199.973570.3%
 
63.049550.3%
 
222.25500.2%
 
193.757500.2%
 
2.611490.2%
 
17.046480.2%
 
2200.44470.2%
 
45.269460.2%
 
6.297460.2%
 
111.857450.2%
 
10.997400.2%
 
7.297400.2%
 
281.997400.2%
 
0.06390.2%
 
20.079390.2%
 
0.894380.2%
 
0.014360.2%
 
3826.87360.2%
 
0.047350.2%
 
18.203350.2%
 
10.045340.2%
 
3732.415340.2%
 
2176.364340.2%
 
Other values (11277)1615678.6%
 
(Missing)3851.9%
 
ValueCountFrequency (%) 
0297514.5%
 
0.0015< 0.1%
 
0.002280.1%
 
0.0037< 0.1%
 
0.0042< 0.1%
 
0.0058< 0.1%
 
0.0064< 0.1%
 
0.0071< 0.1%
 
0.008220.1%
 
0.009120.1%
 
ValueCountFrequency (%) 
19741.8821< 0.1%
 
19653.4861< 0.1%
 
19624.022< 0.1%
 
19594.5552< 0.1%
 
19476.6931< 0.1%
 
19388.2961< 0.1%
 
19329.3651< 0.1%
 
19299.91< 0.1%
 
19270.4342< 0.1%
 
19240.9691< 0.1%
 

new_cases_per_million
Real number (ℝ)

MISSING
SKEWED
ZEROS

Distinct count6531
Unique (%)32.4%
Missing385
Missing (%)1.9%
Infinite0
Infinite (%)0.0%
Mean13.514348173119826
Minimum-265.189
Maximum4944.376
Zeros8219
Zeros (%)40.0%
Memory size160.7 KiB

Quantile statistics

Minimum-265.189
5-th percentile0
Q10
median0.28
Q36.0745
95-th percentile64.47
Maximum4944.376
Range5209.565
Interquartile range (IQR)6.0745

Descriptive statistics

Standard deviation63.3846036
Coefficient of variation (CV)4.690170979
Kurtosis1936.483265
Mean13.51434817
Median Absolute Deviation (MAD)0.28
Skewness30.78732471
Sum272597.917
Variance4017.607974
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0821940.0%
 
0.207300.1%
 
2.543270.1%
 
0.01270.1%
 
0.392260.1%
 
0.042240.1%
 
0.017240.1%
 
0.588240.1%
 
0.061230.1%
 
0.196230.1%
 
0.018230.1%
 
0.052220.1%
 
0.009210.1%
 
0.026210.1%
 
0.067200.1%
 
0.251190.1%
 
0.001190.1%
 
0.06190.1%
 
0.305190.1%
 
0.084180.1%
 
0.144180.1%
 
0.043180.1%
 
0.078180.1%
 
0.414170.1%
 
0.057170.1%
 
Other values (6506)1143555.6%
 
(Missing)3851.9%
 
ValueCountFrequency (%) 
-265.1891< 0.1%
 
-139.4881< 0.1%
 
-83.8861< 0.1%
 
-38.571< 0.1%
 
-17.241< 0.1%
 
-15.7891< 0.1%
 
-15.251< 0.1%
 
-7.9561< 0.1%
 
-7.7341< 0.1%
 
-2.8341< 0.1%
 
ValueCountFrequency (%) 
4944.3761< 0.1%
 
1722.6531< 0.1%
 
1473.4471< 0.1%
 
1236.0948< 0.1%
 
1060.7581< 0.1%
 
1001.8271< 0.1%
 
861.3261< 0.1%
 
854.4991< 0.1%
 
800.161< 0.1%
 
736.6373< 0.1%
 

total_deaths_per_million
Real number (ℝ≥0)

MISSING
ZEROS

Distinct count5150
Unique (%)25.5%
Missing385
Missing (%)1.9%
Infinite0
Infinite (%)0.0%
Mean23.56194705269942
Minimum0.0
Maximum1237.5510000000002
Zeros8400
Zeros (%)40.9%
Memory size160.7 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0.234
Q35.161
95-th percentile110.228
Maximum1237.551
Range1237.551
Interquartile range (IQR)5.161

Descriptive statistics

Standard deviation95.75791243
Coefficient of variation (CV)4.064091657
Kurtosis68.69972324
Mean23.56194705
Median Absolute Deviation (MAD)0.234
Skewness7.407969188
Sum475268.034
Variance9169.577793
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0840040.9%
 
0.414700.3%
 
15.216700.3%
 
6.094640.3%
 
2.286620.3%
 
0.084610.3%
 
0.352580.3%
 
0.425580.3%
 
1.705580.3%
 
5.716560.3%
 
0.061560.3%
 
26.221550.3%
 
25.828540.3%
 
5.03520.3%
 
34.748520.3%
 
29.624480.2%
 
0.215430.2%
 
33.072410.2%
 
30.635390.2%
 
29.304390.2%
 
0.269390.2%
 
1.799380.2%
 
0.009370.2%
 
27.972360.2%
 
0.026360.2%
 
Other values (5125)1054951.3%
 
(Missing)3851.9%
 
ValueCountFrequency (%) 
0840040.9%
 
0.001130.1%
 
0.0027< 0.1%
 
0.0035< 0.1%
 
0.0042< 0.1%
 
0.00510< 0.1%
 
0.0061< 0.1%
 
0.0073< 0.1%
 
0.008140.1%
 
0.009370.2%
 
ValueCountFrequency (%) 
1237.5516< 0.1%
 
1208.085270.1%
 
1178.625< 0.1%
 
1149.1544< 0.1%
 
1119.6891< 0.1%
 
1060.7583< 0.1%
 
1031.2922< 0.1%
 
1001.8274< 0.1%
 
942.8964< 0.1%
 
883.9651< 0.1%
 

new_deaths_per_million
Real number (ℝ)

MISSING
SKEWED
ZEROS

Distinct count1685
Unique (%)8.4%
Missing385
Missing (%)1.9%
Infinite0
Infinite (%)0.0%
Mean0.5719709483912548
Minimum-41.023
Maximum200.04
Zeros14050
Zeros (%)68.3%
Memory size160.7 KiB

Quantile statistics

Minimum-41.023
5-th percentile0
Q10
median0
Q30.062
95-th percentile2.2225
Maximum200.04
Range241.063
Interquartile range (IQR)0.062

Descriptive statistics

Standard deviation3.528852488
Coefficient of variation (CV)6.169635885
Kurtosis975.0926679
Mean0.5719709484
Median Absolute Deviation (MAD)0
Skewness23.52733283
Sum11537.226
Variance12.45279988
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
01405068.3%
 
0.099620.3%
 
0.039440.2%
 
0.347420.2%
 
0.288360.2%
 
0.02350.2%
 
0.48340.2%
 
0.088330.2%
 
0.147320.2%
 
0.154310.2%
 
0.096300.1%
 
0.196300.1%
 
0.034290.1%
 
0.038290.1%
 
0.03290.1%
 
0.031280.1%
 
0.048270.1%
 
0.137270.1%
 
0.026260.1%
 
0.244260.1%
 
0.144260.1%
 
0.024250.1%
 
0.041250.1%
 
0.027240.1%
 
0.029240.1%
 
Other values (1660)536726.1%
 
(Missing)3851.9%
 
ValueCountFrequency (%) 
-41.0231< 0.1%
 
01405068.3%
 
0.001210.1%
 
0.002110.1%
 
0.003140.1%
 
0.00410< 0.1%
 
0.005210.1%
 
0.006190.1%
 
0.0076< 0.1%
 
0.008200.1%
 
ValueCountFrequency (%) 
200.041< 0.1%
 
176.7931< 0.1%
 
117.8621< 0.1%
 
93.2791< 0.1%
 
88.3961< 0.1%
 
58.9318< 0.1%
 
58.8011< 0.1%
 
51.771< 0.1%
 
50.9631< 0.1%
 
47.391< 0.1%
 

total_tests
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct count5287
Unique (%)95.9%
Missing15041
Missing (%)73.2%
Infinite0
Infinite (%)0.0%
Mean274596.92239347234
Minimum1.0
Maximum15192481.0
Zeros0
Zeros (%)0.0%
Memory size160.7 KiB

Quantile statistics

Minimum1
5-th percentile306.1
Q19266
median47816
Q3172136.5
95-th percentile966647.3
Maximum15192481
Range15192480
Interquartile range (IQR)162870.5

Descriptive statistics

Standard deviation995644.7584
Coefficient of variation (CV)3.625840923
Kurtosis90.85502029
Mean274596.9224
Median Absolute Deviation (MAD)45494
Skewness8.622568735
Sum1514402027
Variance9.91308485e+11
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1114< 0.1%
 
5764< 0.1%
 
334< 0.1%
 
314< 0.1%
 
3264< 0.1%
 
164< 0.1%
 
254< 0.1%
 
5843< 0.1%
 
11453< 0.1%
 
6203< 0.1%
 
523< 0.1%
 
2113< 0.1%
 
363< 0.1%
 
4503< 0.1%
 
143< 0.1%
 
623< 0.1%
 
353< 0.1%
 
42983< 0.1%
 
6093< 0.1%
 
843< 0.1%
 
573< 0.1%
 
453< 0.1%
 
8803< 0.1%
 
203< 0.1%
 
53< 0.1%
 
Other values (5262)543326.4%
 
(Missing)1504173.2%
 
ValueCountFrequency (%) 
12< 0.1%
 
21< 0.1%
 
32< 0.1%
 
41< 0.1%
 
53< 0.1%
 
92< 0.1%
 
101< 0.1%
 
112< 0.1%
 
122< 0.1%
 
132< 0.1%
 
ValueCountFrequency (%) 
151924811< 0.1%
 
149070411< 0.1%
 
146049421< 0.1%
 
141636941< 0.1%
 
137847861< 0.1%
 
134190581< 0.1%
 
130247621< 0.1%
 
126082161< 0.1%
 
122028121< 0.1%
 
118060211< 0.1%
 

new_tests
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct count3514
Unique (%)71.5%
Missing15644
Missing (%)76.1%
Infinite0
Infinite (%)0.0%
Mean10657.82756514658
Minimum1.0
Maximum441248.0
Zeros0
Zeros (%)0.0%
Memory size160.7 KiB

Quantile statistics

Minimum1
5-th percentile32
Q1584.75
median1992
Q36542.25
95-th percentile38464.35
Maximum441248
Range441247
Interquartile range (IQR)5957.5

Descriptive statistics

Standard deviation35388.57175
Coefficient of variation (CV)3.320430128
Kurtosis55.74652196
Mean10657.82757
Median Absolute Deviation (MAD)1798
Skewness6.897168068
Sum52351249
Variance1252351010
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1180.1%
 
2140.1%
 
5130.1%
 
8120.1%
 
1010< 0.1%
 
1610< 0.1%
 
310< 0.1%
 
49< 0.1%
 
79< 0.1%
 
99< 0.1%
 
7228< 0.1%
 
188< 0.1%
 
158< 0.1%
 
248< 0.1%
 
298< 0.1%
 
368< 0.1%
 
328< 0.1%
 
227< 0.1%
 
257< 0.1%
 
4437< 0.1%
 
67< 0.1%
 
136< 0.1%
 
456< 0.1%
 
336< 0.1%
 
276< 0.1%
 
Other values (3489)469022.8%
 
(Missing)1564476.1%
 
ValueCountFrequency (%) 
1180.1%
 
2140.1%
 
310< 0.1%
 
49< 0.1%
 
5130.1%
 
67< 0.1%
 
79< 0.1%
 
8120.1%
 
99< 0.1%
 
1010< 0.1%
 
ValueCountFrequency (%) 
4412481< 0.1%
 
4165461< 0.1%
 
4054041< 0.1%
 
4028081< 0.1%
 
3967911< 0.1%
 
3942961< 0.1%
 
3846441< 0.1%
 
3789081< 0.1%
 
3767511< 0.1%
 
3657281< 0.1%
 

total_tests_per_thousand
Real number (ℝ≥0)

MISSING

Distinct count4033
Unique (%)73.1%
Missing15041
Missing (%)73.2%
Infinite0
Infinite (%)0.0%
Mean12.190792747053493
Minimum0.0
Maximum176.11700000000002
Zeros26
Zeros (%)0.1%
Memory size160.7 KiB

Quantile statistics

Minimum0
5-th percentile0.011
Q10.3985
median2.746
Q314.5555
95-th percentile51.6989
Maximum176.117
Range176.117
Interquartile range (IQR)14.157

Descriptive statistics

Standard deviation22.29090843
Coefficient of variation (CV)1.8285036
Kurtosis17.45505148
Mean12.19079275
Median Absolute Deviation (MAD)2.708
Skewness3.643057346
Sum67232.222
Variance496.8845986
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.001430.2%
 
0.002340.2%
 
0.003280.1%
 
0.007280.1%
 
0260.1%
 
0.004240.1%
 
0.008220.1%
 
0.01220.1%
 
0.015210.1%
 
0.005190.1%
 
0.013170.1%
 
0.016160.1%
 
0.02160.1%
 
0.027150.1%
 
0.018150.1%
 
0.024150.1%
 
0.012150.1%
 
0.006140.1%
 
0.017140.1%
 
0.009120.1%
 
0.011110.1%
 
0.01410< 0.1%
 
0.0559< 0.1%
 
0.0219< 0.1%
 
0.0239< 0.1%
 
Other values (4008)505124.6%
 
(Missing)1504173.2%
 
ValueCountFrequency (%) 
0260.1%
 
0.001430.2%
 
0.002340.2%
 
0.003280.1%
 
0.004240.1%
 
0.005190.1%
 
0.006140.1%
 
0.007280.1%
 
0.008220.1%
 
0.009120.1%
 
ValueCountFrequency (%) 
176.1171< 0.1%
 
175.0561< 0.1%
 
174.571< 0.1%
 
173.0291< 0.1%
 
172.3521< 0.1%
 
172.3161< 0.1%
 
172.1471< 0.1%
 
171.0921< 0.1%
 
170.7081< 0.1%
 
170.5031< 0.1%
 

new_tests_per_thousand
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct count1304
Unique (%)26.5%
Missing15644
Missing (%)76.1%
Infinite0
Infinite (%)0.0%
Mean0.397878664495114
Minimum0.0
Maximum7.285
Zeros171
Zeros (%)0.8%
Memory size160.7 KiB

Quantile statistics

Minimum0
5-th percentile0.001
Q10.031
median0.156
Q30.55025
95-th percentile1.498
Maximum7.285
Range7.285
Interquartile range (IQR)0.51925

Descriptive statistics

Standard deviation0.6108690395
Coefficient of variation (CV)1.535314894
Kurtosis18.50312575
Mean0.3978786645
Median Absolute Deviation (MAD)0.147
Skewness3.449076752
Sum1954.38
Variance0.3731609834
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
01710.8%
 
0.0011090.5%
 
0.002590.3%
 
0.007540.3%
 
0.006520.3%
 
0.004490.2%
 
0.008480.2%
 
0.005480.2%
 
0.014410.2%
 
0.013390.2%
 
0.009380.2%
 
0.012370.2%
 
0.003370.2%
 
0.011320.2%
 
0.017310.2%
 
0.015310.2%
 
0.021300.1%
 
0.019290.1%
 
0.022280.1%
 
0.026280.1%
 
0.01270.1%
 
0.023260.1%
 
0.03260.1%
 
0.059240.1%
 
0.056240.1%
 
Other values (1279)379418.5%
 
(Missing)1564476.1%
 
ValueCountFrequency (%) 
01710.8%
 
0.0011090.5%
 
0.002590.3%
 
0.003370.2%
 
0.004490.2%
 
0.005480.2%
 
0.006520.3%
 
0.007540.3%
 
0.008480.2%
 
0.009380.2%
 
ValueCountFrequency (%) 
7.2851< 0.1%
 
6.7081< 0.1%
 
5.7291< 0.1%
 
5.3571< 0.1%
 
5.141< 0.1%
 
5.0021< 0.1%
 
4.9111< 0.1%
 
4.8971< 0.1%
 
4.7241< 0.1%
 
4.5571< 0.1%
 

new_tests_smoothed
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct count4106
Unique (%)67.9%
Missing14508
Missing (%)70.6%
Infinite0
Infinite (%)0.0%
Mean9595.70171957672
Minimum0.0
Maximum399846.0
Zeros32
Zeros (%)0.2%
Memory size160.7 KiB

Quantile statistics

Minimum0
5-th percentile36
Q1687.5
median2169.5
Q36079.25
95-th percentile40948.45
Maximum399846
Range399846
Interquartile range (IQR)5391.75

Descriptive statistics

Standard deviation30563.02969
Coefficient of variation (CV)3.185075003
Kurtosis66.87949197
Mean9595.70172
Median Absolute Deviation (MAD)1888.5
Skewness7.431160888
Sum58034804
Variance934098783.7
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0320.2%
 
2134250.1%
 
930250.1%
 
2111250.1%
 
4200.1%
 
2200.1%
 
2384200.1%
 
1190.1%
 
3180.1%
 
5150.1%
 
4632120.1%
 
37120.1%
 
27110.1%
 
7110.1%
 
3610< 0.1%
 
810< 0.1%
 
7210< 0.1%
 
259< 0.1%
 
299< 0.1%
 
589< 0.1%
 
459< 0.1%
 
308< 0.1%
 
498< 0.1%
 
238< 0.1%
 
168< 0.1%
 
Other values (4081)568527.7%
 
(Missing)1450870.6%
 
ValueCountFrequency (%) 
0320.2%
 
1190.1%
 
2200.1%
 
3180.1%
 
4200.1%
 
5150.1%
 
67< 0.1%
 
7110.1%
 
810< 0.1%
 
94< 0.1%
 
ValueCountFrequency (%) 
3998461< 0.1%
 
3896111< 0.1%
 
3877831< 0.1%
 
3863181< 0.1%
 
3861961< 0.1%
 
3833041< 0.1%
 
3776191< 0.1%
 
3691811< 0.1%
 
3619931< 0.1%
 
3544071< 0.1%
 

new_tests_smoothed_per_thousand
Real number (ℝ≥0)

HIGH CORRELATION
MISSING
ZEROS

Distinct count1378
Unique (%)22.8%
Missing14508
Missing (%)70.6%
Infinite0
Infinite (%)0.0%
Mean0.36638839285714275
Minimum0.0
Maximum4.993
Zeros237
Zeros (%)1.2%
Memory size160.7 KiB

Quantile statistics

Minimum0
5-th percentile0.001
Q10.034
median0.159
Q30.51525
95-th percentile1.33465
Maximum4.993
Range4.993
Interquartile range (IQR)0.48125

Descriptive statistics

Standard deviation0.5311624173
Coefficient of variation (CV)1.449725012
Kurtosis14.87992995
Mean0.3663883929
Median Absolute Deviation (MAD)0.15
Skewness3.134983854
Sum2215.917
Variance0.2821335136
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
02371.2%
 
0.0011480.7%
 
0.002830.4%
 
0.006750.4%
 
0.007570.3%
 
0.01540.3%
 
0.008480.2%
 
0.003460.2%
 
0.004450.2%
 
0.005440.2%
 
0.009440.2%
 
0.025390.2%
 
0.011390.2%
 
0.014380.2%
 
0.016370.2%
 
0.013350.2%
 
0.124330.2%
 
0.015330.2%
 
0.036330.2%
 
0.019330.2%
 
0.012330.2%
 
0.051320.2%
 
0.285320.2%
 
0.021310.2%
 
0.02280.1%
 
Other values (1353)469122.8%
 
(Missing)1450870.6%
 
ValueCountFrequency (%) 
02371.2%
 
0.0011480.7%
 
0.002830.4%
 
0.003460.2%
 
0.004450.2%
 
0.005440.2%
 
0.006750.4%
 
0.007570.3%
 
0.008480.2%
 
0.009440.2%
 
ValueCountFrequency (%) 
4.9931< 0.1%
 
4.8941< 0.1%
 
4.7851< 0.1%
 
4.7711< 0.1%
 
4.6421< 0.1%
 
4.6041< 0.1%
 
4.4331< 0.1%
 
4.3871< 0.1%
 
4.3521< 0.1%
 
4.3051< 0.1%
 

tests_units
Categorical

MISSING

Distinct count5
Unique (%)0.1%
Missing13909
Missing (%)67.7%
Memory size160.7 KiB
tests performed
2626
people tested
1925
samples tested
1078
units unclear
936
inconsistent units (COVID Tracking Project)
 
82
ValueCountFrequency (%) 
tests performed262612.8%
 
people tested19259.4%
 
samples tested10785.2%
 
units unclear9364.6%
 
inconsistent units (COVID Tracking Project)820.4%
 
(Missing)1390967.7%
 

Length

Max length43
Median length3
Mean length6.661218136
Min length3

Overview of Unicode Properties

Unique unicode characters28
Unique unicode categories (?)5
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
n3010022.0%
 
e1991214.5%
 
a1600511.7%
 
t125229.1%
 
s115938.5%
 
p75545.5%
 
68935.0%
 
r63524.6%
 
d56294.1%
 
o47153.4%
 
l39392.9%
 
m37042.7%
 
f26261.9%
 
u19541.4%
 
i12640.9%
 
c11820.9%
 
(820.1%
 
C820.1%
 
O820.1%
 
V820.1%
 
I820.1%
 
D820.1%
 
T820.1%
 
k820.1%
 
g820.1%
 
Other values (3)2460.2%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter12929794.4%
 
Space Separator68935.0%
 
Uppercase Letter5740.4%
 
Open Punctuation820.1%
 
Close Punctuation820.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n3010023.3%
 
e1991215.4%
 
a1600512.4%
 
t125229.7%
 
s115939.0%
 
p75545.8%
 
r63524.9%
 
d56294.4%
 
o47153.6%
 
l39393.0%
 
m37042.9%
 
f26262.0%
 
u19541.5%
 
i12641.0%
 
c11820.9%
 
k820.1%
 
g820.1%
 
j820.1%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
6893100.0%
 

Most frequent Open Punctuation characters

ValueCountFrequency (%) 
(82100.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
C8214.3%
 
O8214.3%
 
V8214.3%
 
I8214.3%
 
D8214.3%
 
T8214.3%
 
P8214.3%
 

Most frequent Close Punctuation characters

ValueCountFrequency (%) 
)82100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin12987194.8%
 
Common70575.2%
 

Most frequent Latin characters

ValueCountFrequency (%) 
n3010023.2%
 
e1991215.3%
 
a1600512.3%
 
t125229.6%
 
s115938.9%
 
p75545.8%
 
r63524.9%
 
d56294.3%
 
o47153.6%
 
l39393.0%
 
m37042.9%
 
f26262.0%
 
u19541.5%
 
i12641.0%
 
c11820.9%
 
C820.1%
 
O820.1%
 
V820.1%
 
I820.1%
 
D820.1%
 
T820.1%
 
k820.1%
 
g820.1%
 
P820.1%
 
j820.1%
 

Most frequent Common characters

ValueCountFrequency (%) 
689397.7%
 
(821.2%
 
)821.2%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII136928100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
n3010022.0%
 
e1991214.5%
 
a1600511.7%
 
t125229.1%
 
s115938.5%
 
p75545.5%
 
68935.0%
 
r63524.6%
 
d56294.1%
 
o47153.4%
 
l39392.9%
 
m37042.7%
 
f26261.9%
 
u19541.4%
 
i12640.9%
 
c11820.9%
 
(820.1%
 
C820.1%
 
O820.1%
 
V820.1%
 
I820.1%
 
D820.1%
 
T820.1%
 
k820.1%
 
g820.1%
 
Other values (3)2460.2%
 

stringency_index
Real number (ℝ≥0)

MISSING
ZEROS

Distinct count157
Unique (%)1.0%
Missing4309
Missing (%)21.0%
Infinite0
Infinite (%)0.0%
Mean55.562003446790186
Minimum0.0
Maximum100.0
Zeros2171
Zeros (%)10.6%
Memory size160.7 KiB

Quantile statistics

Minimum0
5-th percentile0
Q119.44
median68.52
Q383.33
95-th percentile96.3
Maximum100
Range100
Interquartile range (IQR)63.89

Descriptive statistics

Standard deviation34.11979596
Coefficient of variation (CV)0.6140850553
Kurtosis-1.262909198
Mean55.56200345
Median Absolute Deviation (MAD)20.83
Skewness-0.5238120069
Sum902715.87
Variance1164.160476
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0217110.6%
 
82.414392.1%
 
96.34202.0%
 
11.114192.0%
 
1003691.8%
 
73.153571.7%
 
19.443521.7%
 
85.193491.7%
 
93.523471.7%
 
90.743231.6%
 
78.73201.6%
 
5.563131.5%
 
13.893021.5%
 
81.482991.5%
 
84.262961.4%
 
77.782941.4%
 
79.632871.4%
 
75.932611.3%
 
87.042521.2%
 
2.782471.2%
 
87.962281.1%
 
94.442251.1%
 
80.562211.1%
 
88.892141.0%
 
86.112131.0%
 
Other values (132)672932.7%
 
(Missing)430921.0%
 
ValueCountFrequency (%) 
0217110.6%
 
2.782471.2%
 
5.563131.5%
 
8.331870.9%
 
10.19380.2%
 
11.114192.0%
 
12.044< 0.1%
 
13.893021.5%
 
14.815< 0.1%
 
15.745< 0.1%
 
ValueCountFrequency (%) 
1003691.8%
 
98.15320.2%
 
97.221440.7%
 
96.34202.0%
 
95.37170.1%
 
94.442251.1%
 
93.523471.7%
 
92.591630.8%
 
92.13360.2%
 
91.67900.4%
 

population
Real number (ℝ≥0)

Distinct count211
Unique (%)1.0%
Missing64
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean107377797.9893129
Minimum809.0
Maximum7794798729.0
Zeros0
Zeros (%)0.0%
Memory size160.7 KiB

Quantile statistics

Minimum809
5-th percentile48865
Q12225728
median9537642
Q334813867
95-th percentile212559409
Maximum7794798729
Range7794797920
Interquartile range (IQR)32588139

Descriptive statistics

Standard deviation684907353.4
Coefficient of variation (CV)6.378482016
Kurtosis114.1793924
Mean107377798
Median Absolute Deviation (MAD)9100159
Skewness10.52003979
Sum2.200385836e+12
Variance4.690980828e+17
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
377421571510.7%
 
254998811510.7%
 
291368081510.7%
 
27222911510.7%
 
1289327531510.7%
 
3310026471510.7%
 
54212421510.7%
 
86555411510.7%
 
86546181510.7%
 
512691831510.7%
 
107089821510.7%
 
1459344601510.7%
 
171348731510.7%
 
2125594091510.7%
 
41052681510.7%
 
58503431510.7%
 
604618281510.7%
 
100992701510.7%
 
13265391510.7%
 
973385831510.7%
 
678860041510.7%
 
90064001510.7%
 
1264764581510.7%
 
55407181510.7%
 
652735121510.7%
 
Other values (186)1671781.3%
 
ValueCountFrequency (%) 
809760.4%
 
3483560.3%
 
4999690.3%
 
15002640.3%
 
26221580.3%
 
30237640.3%
 
33691710.3%
 
339381500.7%
 
38137800.4%
 
38718660.3%
 
ValueCountFrequency (%) 
77947987291510.7%
 
14393237741510.7%
 
13800043851500.7%
 
3310026471510.7%
 
2735236211440.7%
 
2208923311460.7%
 
2125594091510.7%
 
2061395871400.7%
 
164689383870.4%
 
1459344601510.7%
 

population_density
Real number (ℝ≥0)

MISSING

Distinct count199
Unique (%)1.0%
Missing910
Missing (%)4.4%
Infinite0
Infinite (%)0.0%
Mean425.76778163493844
Minimum0.13699999999999998
Maximum19347.5
Zeros0
Zeros (%)0.0%
Memory size160.7 KiB

Quantile statistics

Minimum0.137
5-th percentile4.289
Q141.285
median93.105
Q3227.322
95-th percentile1209.088
Maximum19347.5
Range19347.363
Interquartile range (IQR)186.037

Descriptive statistics

Standard deviation1841.393166
Coefficient of variation (CV)4.324876718
Kurtosis79.04204893
Mean425.7677816
Median Absolute Deviation (MAD)68.387
Skewness8.488692415
Sum8364633.838
Variance3390728.792
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
205.8591510.7%
 
508.5441510.7%
 
272.8981510.7%
 
4.0371510.7%
 
18.1361510.7%
 
58.0451510.7%
 
14.4621510.7%
 
527.9671510.7%
 
122.5781510.7%
 
31.0331510.7%
 
204.431510.7%
 
214.2431510.7%
 
237.0161510.7%
 
35.6081510.7%
 
83.4791510.7%
 
3.2021510.7%
 
45.1351510.7%
 
73.7261510.7%
 
24.7181510.7%
 
136.521510.7%
 
375.5641510.7%
 
7915.7311510.7%
 
402.6061510.7%
 
49.8311510.7%
 
137.1761510.7%
 
Other values (174)1587177.2%
 
(Missing)9104.4%
 
ValueCountFrequency (%) 
0.137710.3%
 
1.98750.4%
 
3.078760.4%
 
3.2021510.7%
 
3.4041510.7%
 
3.612710.3%
 
3.623660.3%
 
3.952760.4%
 
4.0371510.7%
 
4.044590.3%
 
ValueCountFrequency (%) 
19347.51390.7%
 
7915.7311510.7%
 
7039.7141170.6%
 
3457.1710.3%
 
1935.9071500.7%
 
1454.433810.4%
 
1454.037840.4%
 
1308.82710.3%
 
1265.036870.4%
 
1209.088650.3%
 

median_age
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct count133
Unique (%)0.7%
Missing1863
Missing (%)9.1%
Infinite0
Infinite (%)0.0%
Mean32.34845129192747
Minimum15.1
Maximum48.2
Zeros0
Zeros (%)0.0%
Memory size160.7 KiB

Quantile statistics

Minimum15.1
5-th percentile18
Q125.3
median32.4
Q340.8
95-th percentile44.8
Maximum48.2
Range33.1
Interquartile range (IQR)15.5

Descriptive statistics

Standard deviation8.960297524
Coefficient of variation (CV)0.2769930914
Kurtosis-1.166204765
Mean32.34845129
Median Absolute Deviation (MAD)7.4
Skewness-0.161912858
Sum604689.6
Variance80.28693172
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
32.45252.6%
 
38.74492.2%
 
31.93161.5%
 
30.63101.5%
 
29.13011.5%
 
37.93001.5%
 
39.72951.4%
 
29.32951.4%
 
41.22451.2%
 
41.82391.2%
 
43.42381.2%
 
42.42351.1%
 
202341.1%
 
37.32301.1%
 
41.42281.1%
 
43.12271.1%
 
28.22251.1%
 
28.62231.1%
 
252181.1%
 
18.72171.1%
 
42.22161.1%
 
39.12151.0%
 
27.62141.0%
 
17.72141.0%
 
20.32021.0%
 
Other values (108)1208258.8%
 
(Missing)18639.1%
 
ValueCountFrequency (%) 
15.1700.3%
 
16.41340.7%
 
16.7710.3%
 
16.81430.7%
 
17780.4%
 
17.51320.6%
 
17.6780.4%
 
17.72141.0%
 
18690.3%
 
18.11971.0%
 
ValueCountFrequency (%) 
48.21510.7%
 
47.91510.7%
 
46.61510.7%
 
46.2900.4%
 
45.51500.7%
 
45.31510.7%
 
44.81170.6%
 
44.7810.4%
 
44.5850.4%
 
44.41510.7%
 

aged_65_older
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct count183
Unique (%)1.0%
Missing2115
Missing (%)10.3%
Infinite0
Infinite (%)0.0%
Mean9.869730491838839
Minimum1.1440000000000001
Maximum27.049
Zeros0
Zeros (%)0.0%
Memory size160.7 KiB

Quantile statistics

Minimum1.144
5-th percentile2.405
Q14.029
median7.775
Q315.322
95-th percentile20.396
Maximum27.049
Range25.905
Interquartile range (IQR)11.293

Descriptive statistics

Standard deviation6.468237517
Coefficient of variation (CV)0.6553611087
Kurtosis-1.050982889
Mean9.869730492
Median Absolute Deviation (MAD)4.796
Skewness0.5009411594
Sum182007.7
Variance41.83809658
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
27.0491510.7%
 
19.6771510.7%
 
19.0271510.7%
 
5.8091510.7%
 
19.2021510.7%
 
23.0211510.7%
 
12.9221510.7%
 
15.4131510.7%
 
14.1781510.7%
 
18.5171510.7%
 
6.2931510.7%
 
13.9141510.7%
 
11.7331510.7%
 
16.8211510.7%
 
8.6961510.7%
 
14.7991510.7%
 
21.2281510.7%
 
19.9851510.7%
 
6.8571510.7%
 
16.9841510.7%
 
18.5711510.7%
 
5.441510.7%
 
18.4361510.7%
 
19.7241510.7%
 
7.151510.7%
 
Other values (158)1466671.3%
 
(Missing)211510.3%
 
ValueCountFrequency (%) 
1.1441450.7%
 
1.3071470.7%
 
2.168690.3%
 
2.339730.4%
 
2.3451480.7%
 
2.3551450.7%
 
2.3721500.7%
 
2.405690.3%
 
2.409780.4%
 
2.48720.4%
 
ValueCountFrequency (%) 
27.0491510.7%
 
23.0211510.7%
 
21.502900.4%
 
21.4531510.7%
 
21.2281510.7%
 
20.801810.4%
 
20.3961510.7%
 
19.9851510.7%
 
19.754910.4%
 
19.7241510.7%
 

aged_70_older
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct count182
Unique (%)1.0%
Missing1957
Missing (%)9.5%
Infinite0
Infinite (%)0.0%
Mean6.284985751922146
Minimum0.526
Maximum18.493
Zeros0
Zeros (%)0.0%
Memory size160.7 KiB

Quantile statistics

Minimum0.526
5-th percentile1.38
Q12.361
median4.832
Q39.842
95-th percentile13.799
Maximum18.493
Range17.967
Interquartile range (IQR)7.481

Descriptive statistics

Standard deviation4.442501178
Coefficient of variation (CV)0.7068434764
Kurtosis-0.7919614551
Mean6.284985752
Median Absolute Deviation (MAD)3.049
Skewness0.6374421157
Sum116894.45
Variance19.73581672
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3.0532571.3%
 
1.8451610.8%
 
2.0631520.7%
 
9.7881510.7%
 
9.2071510.7%
 
11.8811510.7%
 
13.4331510.7%
 
12.5271510.7%
 
9.3931510.7%
 
3.1821510.7%
 
15.9571510.7%
 
8.6221510.7%
 
3.4071510.7%
 
16.241510.7%
 
13.4911510.7%
 
14.5241510.7%
 
9.7321510.7%
 
13.2641510.7%
 
13.0791510.7%
 
4.7181510.7%
 
12.8491510.7%
 
13.0531510.7%
 
10.7971510.7%
 
11.581510.7%
 
3.2121510.7%
 
Other values (157)1470771.5%
 
(Missing)19579.5%
 
ValueCountFrequency (%) 
0.5261450.7%
 
0.6171470.7%
 
1.1141480.7%
 
1.285590.3%
 
1.308690.3%
 
1.3371410.7%
 
1.358780.4%
 
1.362690.3%
 
1.378700.3%
 
1.38340.2%
 
ValueCountFrequency (%) 
18.4931510.7%
 
16.241510.7%
 
15.9571510.7%
 
14.924900.4%
 
14.5241510.7%
 
14.136910.4%
 
13.7991500.7%
 
13.7781510.7%
 
13.7481510.7%
 
13.4911510.7%
 

gdp_per_capita
Real number (ℝ≥0)

MISSING

Distinct count183
Unique (%)1.0%
Missing2122
Missing (%)10.3%
Infinite0
Infinite (%)0.0%
Mean23160.372423890636
Minimum661.24
Maximum116935.6
Zeros0
Zeros (%)0.0%
Memory size160.7 KiB

Quantile statistics

Minimum661.24
5-th percentile1561.767
Q16462.531
median15807.374
Q335938.374
95-th percentile65530.537
Maximum116935.6
Range116274.36
Interquartile range (IQR)29475.843

Descriptive statistics

Standard deviation21355.07134
Coefficient of variation (CV)0.9220521566
Kurtosis2.733729647
Mean23160.37242
Median Absolute Deviation (MAD)11910.473
Skewness1.486745566
Sum426938305.3
Variance456039072
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2442.8041510.7%
 
39002.2231510.7%
 
85535.3831510.7%
 
45436.6861510.7%
 
15469.2071510.7%
 
15308.7121510.7%
 
44648.711510.7%
 
46482.9581510.7%
 
26808.1641510.7%
 
24765.9541510.7%
 
48472.5451510.7%
 
46682.5151510.7%
 
29481.2521510.7%
 
39753.2441510.7%
 
32605.9061510.7%
 
17336.4691510.7%
 
64800.0571510.7%
 
45229.2451510.7%
 
35220.0841510.7%
 
57410.1661510.7%
 
42658.5761510.7%
 
14103.4521510.7%
 
46949.2831510.7%
 
38605.6711510.7%
 
35938.3741510.7%
 
Other values (158)1465971.3%
 
(Missing)212210.3%
 
ValueCountFrequency (%) 
661.24750.4%
 
702.225590.3%
 
752.788740.4%
 
808.133780.4%
 
926700.3%
 
1095.042570.3%
 
1136.103680.3%
 
1390.3590.3%
 
1413.89280.1%
 
1416.44700.3%
 
ValueCountFrequency (%) 
116935.61470.7%
 
94277.9651440.7%
 
85535.3831510.7%
 
71809.251800.4%
 
67335.2931500.7%
 
67293.4831450.7%
 
65530.5371480.7%
 
64800.0571510.7%
 
57410.1661510.7%
 
56861.471500.7%
 

extreme_poverty
Real number (ℝ≥0)

MISSING

Distinct count73
Unique (%)0.6%
Missing8331
Missing (%)40.5%
Infinite0
Infinite (%)0.0%
Mean10.198298568507157
Minimum0.1
Maximum77.6
Zeros0
Zeros (%)0.0%
Memory size160.7 KiB

Quantile statistics

Minimum0.1
5-th percentile0.1
Q10.5
median1.5
Q310
95-th percentile52.2
Maximum77.6
Range77.5
Interquartile range (IQR)9.5

Descriptive statistics

Standard deviation17.60508628
Coefficient of variation (CV)1.726276806
Kurtosis3.790537814
Mean10.19829857
Median Absolute Deviation (MAD)1.3
Skewness2.125502709
Sum124674.2
Variance309.9390629
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.216718.1%
 
0.512246.0%
 
0.79204.5%
 
0.17553.7%
 
14182.0%
 
23841.9%
 
1.33161.5%
 
1.53091.5%
 
2.53001.5%
 
5.72931.4%
 
3.42281.1%
 
1.11580.8%
 
2.21540.7%
 
1.21510.7%
 
151510.7%
 
101510.7%
 
21.21500.7%
 
4.21480.7%
 
41460.7%
 
3.61460.7%
 
1.41430.7%
 
1.81420.7%
 
51420.7%
 
1.61420.7%
 
18.91130.5%
 
Other values (48)337016.4%
 
(Missing)833140.5%
 
ValueCountFrequency (%) 
0.17553.7%
 
0.216718.1%
 
0.512246.0%
 
0.6840.4%
 
0.79204.5%
 
14182.0%
 
1.11580.8%
 
1.21510.7%
 
1.33161.5%
 
1.41430.7%
 
ValueCountFrequency (%) 
77.6700.3%
 
77.1780.4%
 
71.7590.3%
 
71.4570.3%
 
67.1640.3%
 
62.9680.3%
 
59.6150.1%
 
57.5720.4%
 
56760.4%
 
52.2590.3%
 

cvd_death_rate
Real number (ℝ≥0)

MISSING

Distinct count186
Unique (%)1.0%
Missing1944
Missing (%)9.5%
Infinite0
Infinite (%)0.0%
Mean245.31734053298948
Minimum79.37
Maximum724.4169999999999
Zeros0
Zeros (%)0.0%
Memory size160.7 KiB

Quantile statistics

Minimum79.37
5-th percentile99.403
Q1151.089
median233.07
Q3311.11
95-th percentile460.043
Maximum724.417
Range645.047
Interquartile range (IQR)160.021

Descriptive statistics

Standard deviation119.001868
Coefficient of variation (CV)0.4850935842
Kurtosis0.8065004189
Mean245.3173405
Median Absolute Deviation (MAD)81.981
Skewness0.9385523442
Sum4565846.342
Variance14161.44458
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
233.071510.7%
 
79.371510.7%
 
151.0891510.7%
 
177.9611510.7%
 
133.9821510.7%
 
114.7671510.7%
 
253.7821510.7%
 
260.9421510.7%
 
113.1511510.7%
 
93.321510.7%
 
114.8981510.7%
 
92.2431510.7%
 
443.1291510.7%
 
245.4651510.7%
 
109.3611510.7%
 
114.3161510.7%
 
107.7911510.7%
 
86.061510.7%
 
261.8991510.7%
 
152.7831510.7%
 
342.9891510.7%
 
153.5071510.7%
 
260.7971510.7%
 
122.1371510.7%
 
227.4851510.7%
 
Other values (161)1483772.2%
 
(Missing)19449.5%
 
ValueCountFrequency (%) 
79.371510.7%
 
85.755870.4%
 
85.9981510.7%
 
86.061510.7%
 
92.2431510.7%
 
93.321510.7%
 
99.4031500.7%
 
99.7391510.7%
 
103.9571500.7%
 
105.5991510.7%
 
ValueCountFrequency (%) 
724.417750.4%
 
597.0291410.7%
 
561.494700.3%
 
559.8121440.7%
 
539.849770.4%
 
525.4321470.7%
 
496.2181480.7%
 
495.003500.2%
 
466.792780.4%
 
460.043750.4%
 

diabetes_prevalence
Real number (ℝ≥0)

MISSING

Distinct count141
Unique (%)0.7%
Missing1259
Missing (%)6.1%
Infinite0
Infinite (%)0.0%
Mean8.00911333367881
Minimum0.99
Maximum23.36
Zeros0
Zeros (%)0.0%
Memory size160.7 KiB

Quantile statistics

Minimum0.99
5-th percentile2.42
Q15.31
median7.11
Q310.08
95-th percentile16.52
Maximum23.36
Range22.37
Interquartile range (IQR)4.77

Descriptive statistics

Standard deviation4.029353695
Coefficient of variation (CV)0.5030961015
Kurtosis1.496405661
Mean8.009113334
Median Absolute Deviation (MAD)2.3
Skewness1.133265288
Sum154551.86
Variance16.2356912
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2.429894.8%
 
7.116903.4%
 
10.084692.3%
 
3.944352.1%
 
11.624222.1%
 
5.313021.5%
 
5.593021.5%
 
9.743001.5%
 
16.522971.4%
 
9.592921.4%
 
6.052741.3%
 
8.832331.1%
 
5.722321.1%
 
7.22301.1%
 
4.282271.1%
 
6.12241.1%
 
42081.0%
 
8.331991.0%
 
8.271600.8%
 
6.181510.7%
 
7.261510.7%
 
13.061510.7%
 
8.111510.7%
 
16.741510.7%
 
61510.7%
 
Other values (116)1190657.9%
 
(Missing)12596.1%
 
ValueCountFrequency (%) 
0.99740.4%
 
1.82700.3%
 
1.91730.4%
 
2.16710.3%
 
2.429894.8%
 
2.5690.3%
 
2.92850.4%
 
3.281500.7%
 
3.3680.3%
 
3.671510.7%
 
ValueCountFrequency (%) 
23.36700.3%
 
22.63720.4%
 
22.02710.3%
 
21.52720.4%
 
17.72850.4%
 
17.65700.3%
 
17.311470.7%
 
17.261450.7%
 
17.11670.3%
 
16.741510.7%
 

female_smokers
Real number (ℝ≥0)

MISSING

Distinct count107
Unique (%)0.7%
Missing5405
Missing (%)26.3%
Infinite0
Infinite (%)0.0%
Mean11.310047785624711
Minimum0.1
Maximum44.0
Zeros0
Zeros (%)0.0%
Memory size160.7 KiB

Quantile statistics

Minimum0.1
5-th percentile0.4
Q11.9
median7
Q319.8
95-th percentile30.2
Maximum44
Range43.9
Interquartile range (IQR)17.9

Descriptive statistics

Standard deviation10.56988318
Coefficient of variation (CV)0.9345568986
Kurtosis-0.5626221244
Mean11.31004779
Median Absolute Deviation (MAD)6.1
Skewness0.7661125146
Sum171358.534
Variance111.7224305
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1.95182.5%
 
24452.2%
 
0.84112.0%
 
13891.9%
 
0.33631.8%
 
2.82901.4%
 
30.12321.1%
 
1.22301.1%
 
19.62301.1%
 
20.92281.1%
 
5.32261.1%
 
1.72231.1%
 
0.72191.1%
 
0.22161.1%
 
1.52161.1%
 
0.62141.0%
 
1.62131.0%
 
0.41810.9%
 
4.71620.8%
 
7.11580.8%
 
15.41510.7%
 
35.31510.7%
 
24.51510.7%
 
121510.7%
 
25.11510.7%
 
Other values (82)903243.9%
 
(Missing)540526.3%
 
ValueCountFrequency (%) 
0.1700.3%
 
0.22161.1%
 
0.33631.8%
 
0.41810.9%
 
0.51450.7%
 
0.62141.0%
 
0.72191.1%
 
0.84112.0%
 
0.9770.4%
 
13891.9%
 
ValueCountFrequency (%) 
44730.4%
 
37.7940.5%
 
35.31510.7%
 
34.31510.7%
 
34.2860.4%
 
30.51510.7%
 
30.2780.4%
 
30.12321.1%
 
29770.4%
 
28.41510.7%
 

male_smokers
Real number (ℝ≥0)

MISSING

Distinct count122
Unique (%)0.8%
Missing5569
Missing (%)27.1%
Infinite0
Infinite (%)0.0%
Mean32.63865917128177
Minimum7.7
Maximum78.1
Zeros0
Zeros (%)0.0%
Memory size160.7 KiB

Quantile statistics

Minimum7.7
5-th percentile13.5
Q121.4
median31.4
Q340.8
95-th percentile53.3
Maximum78.1
Range70.4
Interquartile range (IQR)19.4

Descriptive statistics

Standard deviation13.21017567
Coefficient of variation (CV)0.4047401458
Kurtosis0.4076086274
Mean32.63865917
Median Absolute Deviation (MAD)9.5
Skewness0.5597867572
Sum489155.585
Variance174.5087413
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
24.73551.7%
 
31.43011.5%
 
16.52971.4%
 
33.72931.4%
 
16.62401.2%
 
33.12391.2%
 
30.92311.1%
 
37.82281.1%
 
18.82231.1%
 
12.32201.1%
 
40.72181.1%
 
34.81580.8%
 
20.41580.8%
 
40.91510.7%
 
21.41510.7%
 
42.41510.7%
 
17.91510.7%
 
18.91510.7%
 
20.71510.7%
 
45.91510.7%
 
39.31510.7%
 
28.31510.7%
 
15.21510.7%
 
381510.7%
 
58.31510.7%
 
Other values (97)1001448.7%
 
(Missing)556927.1%
 
ValueCountFrequency (%) 
7.7770.4%
 
8.5770.4%
 
9.9820.4%
 
10.81400.7%
 
11.4690.3%
 
12.32201.1%
 
13.5860.4%
 
14.2770.4%
 
14.5730.4%
 
15.21510.7%
 
ValueCountFrequency (%) 
78.1690.3%
 
76.11440.7%
 
65.8820.4%
 
58.31510.7%
 
55.51480.7%
 
55810.4%
 
53.9150.1%
 
53.3760.4%
 
52.7790.4%
 
52.3750.4%
 

handwashing_facilities
Real number (ℝ≥0)

MISSING

Distinct count92
Unique (%)1.1%
Missing12421
Missing (%)60.4%
Infinite0
Infinite (%)0.0%
Mean55.26476140135218
Minimum1.188
Maximum98.999
Zeros0
Zeros (%)0.0%
Memory size160.7 KiB

Quantile statistics

Minimum1.188
5-th percentile6.144
Q124.64
median59.607
Q383.841
95-th percentile95.803
Maximum98.999
Range97.811
Interquartile range (IQR)59.201

Descriptive statistics

Standard deviation30.95356979
Coefficient of variation (CV)0.5600959636
Kurtosis-1.358829925
Mean55.2647614
Median Absolute Deviation (MAD)27.372
Skewness-0.2613331165
Sum449578.834
Variance958.1234829
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
87.8471510.7%
 
85.8471510.7%
 
47.7821510.7%
 
60.131510.7%
 
59.551500.7%
 
94.5761490.7%
 
78.4631470.7%
 
89.8271470.7%
 
80.6351460.7%
 
83.7411460.7%
 
59.6071460.7%
 
97.41450.7%
 
90.671440.7%
 
83.2411440.7%
 
64.2041440.7%
 
55.1821420.7%
 
94.0431420.7%
 
66.2291420.7%
 
37.7461410.7%
 
41.9491400.7%
 
43.9931130.5%
 
97.719940.5%
 
20.859890.4%
 
34.808870.4%
 
65.386860.4%
 
Other values (67)474723.1%
 
(Missing)1242160.4%
 
ValueCountFrequency (%) 
1.188740.4%
 
2.117150.1%
 
2.735780.4%
 
4.472780.4%
 
4.617760.4%
 
5.818710.3%
 
6.144590.3%
 
6.403640.3%
 
7.876730.4%
 
7.96770.4%
 
ValueCountFrequency (%) 
98.999780.4%
 
97.719940.5%
 
97.41450.7%
 
97.164780.4%
 
95.803810.4%
 
94.5761490.7%
 
94.0431420.7%
 
90.671440.7%
 
90.65720.4%
 
90.083670.3%
 

hospital_beds_per_100k
Real number (ℝ≥0)

MISSING

Distinct count100
Unique (%)0.6%
Missing3392
Missing (%)16.5%
Infinite0
Infinite (%)0.0%
Mean3.227052318806805
Minimum0.1
Maximum13.8
Zeros0
Zeros (%)0.0%
Memory size160.7 KiB

Quantile statistics

Minimum0.1
5-th percentile0.5
Q11.4
median2.6
Q34.28
95-th percentile8
Maximum13.8
Range13.7
Interquartile range (IQR)2.88

Descriptive statistics

Standard deviation2.608836739
Coefficient of variation (CV)0.8084271593
Kurtosis3.733886779
Mean3.227052319
Median Absolute Deviation (MAD)1.3
Skewness1.762623971
Sum55389.126
Variance6.806029131
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1.65972.9%
 
0.85152.5%
 
0.74862.4%
 
1.34342.1%
 
1.54312.1%
 
1.43841.9%
 
0.33741.8%
 
23701.8%
 
3.63691.8%
 
2.63661.8%
 
2.13661.8%
 
2.53021.5%
 
2.33001.5%
 
1.92971.4%
 
1.22921.4%
 
0.92911.4%
 
3.82891.4%
 
0.52841.4%
 
2.92711.3%
 
1.12651.3%
 
1.72251.1%
 
0.62221.1%
 
12221.1%
 
2.21790.9%
 
2.71650.8%
 
Other values (75)886843.1%
 
(Missing)339216.5%
 
ValueCountFrequency (%) 
0.1650.3%
 
0.2700.3%
 
0.33741.8%
 
0.4780.4%
 
0.52841.4%
 
0.531500.7%
 
0.62221.1%
 
0.74862.4%
 
0.85152.5%
 
0.92911.4%
 
ValueCountFrequency (%) 
13.81390.7%
 
13.051510.7%
 
12.271510.7%
 
111510.7%
 
8.8770.4%
 
8.051510.7%
 
81510.7%
 
7.454810.4%
 
7.371510.7%
 
7.02870.4%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

Sample

First rows

iso_codelocationdatetotal_casesnew_casestotal_deathsnew_deathstotal_cases_per_millionnew_cases_per_milliontotal_deaths_per_millionnew_deaths_per_milliontotal_testsnew_teststotal_tests_per_thousandnew_tests_per_thousandnew_tests_smoothednew_tests_smoothed_per_thousandtests_unitsstringency_indexpopulationpopulation_densitymedian_ageaged_65_olderaged_70_oldergdp_per_capitaextreme_povertycvd_death_ratediabetes_prevalencefemale_smokersmale_smokershandwashing_facilitieshospital_beds_per_100k
0ABWAruba2020-03-13220018.73318.7330.00.0NaNNaNNaNNaNNaNNaNNaN0.00106766.0584.841.213.0857.45235973.781NaNNaN11.62NaNNaNNaNNaN
1ABWAruba2020-03-20420037.46518.7330.00.0NaNNaNNaNNaNNaNNaNNaN30.56106766.0584.841.213.0857.45235973.781NaNNaN11.62NaNNaNNaNNaN
2ABWAruba2020-03-2412800112.39574.9300.00.0NaNNaNNaNNaNNaNNaNNaN41.67106766.0584.841.213.0857.45235973.781NaNNaN11.62NaNNaNNaNNaN
3ABWAruba2020-03-2517500159.22746.8310.00.0NaNNaNNaNNaNNaNNaNNaN41.67106766.0584.841.213.0857.45235973.781NaNNaN11.62NaNNaNNaNNaN
4ABWAruba2020-03-2619200177.95918.7330.00.0NaNNaNNaNNaNNaNNaNNaN41.67106766.0584.841.213.0857.45235973.781NaNNaN11.62NaNNaNNaNNaN
5ABWAruba2020-03-2728900262.25684.2960.00.0NaNNaNNaNNaNNaNNaNNaN41.67106766.0584.841.213.0857.45235973.781NaNNaN11.62NaNNaNNaNNaN
6ABWAruba2020-03-2828000262.2560.0000.00.0NaNNaNNaNNaNNaNNaNNaN41.67106766.0584.841.213.0857.45235973.781NaNNaN11.62NaNNaNNaNNaN
7ABWAruba2020-03-2928000262.2560.0000.00.0NaNNaNNaNNaNNaNNaNNaN82.41106766.0584.841.213.0857.45235973.781NaNNaN11.62NaNNaNNaNNaN
8ABWAruba2020-03-30502200468.314206.0580.00.0NaNNaNNaNNaNNaNNaNNaN82.41106766.0584.841.213.0857.45235973.781NaNNaN11.62NaNNaNNaNNaN
9ABWAruba2020-04-0155500515.14546.8310.00.0NaNNaNNaNNaNNaNNaNNaN82.41106766.0584.841.213.0857.45235973.781NaNNaN11.62NaNNaNNaNNaN

Last rows

iso_codelocationdatetotal_casesnew_casestotal_deathsnew_deathstotal_cases_per_millionnew_cases_per_milliontotal_deaths_per_millionnew_deaths_per_milliontotal_testsnew_teststotal_tests_per_thousandnew_tests_per_thousandnew_tests_smoothednew_tests_smoothed_per_thousandtests_unitsstringency_indexpopulationpopulation_densitymedian_ageaged_65_olderaged_70_oldergdp_per_capitaextreme_povertycvd_death_ratediabetes_prevalencefemale_smokersmale_smokershandwashing_facilitieshospital_beds_per_100k
20546NaNInternational2020-02-23634020NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
20547NaNInternational2020-02-246915731NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
20548NaNInternational2020-02-25691030NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
20549NaNInternational2020-02-26691041NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
20550NaNInternational2020-02-277051440NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
20551NaNInternational2020-02-28705040NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
20552NaNInternational2020-02-29705062NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
20553NaNInternational2020-03-01705060NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
20554NaNInternational2020-03-02705060NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
20555NaNInternational2020-03-10696-971NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN